Nonlinear Manifold Learning for Visual Speech Recognition
نویسندگان
چکیده
A technique for representing and learning smooth nonlinear manifolds is presented and applied to sev eral lip reading tasks. Given a set of points drawn from a smooth manifold in an abstract feature space, the technique is capable of determining the structure of the surface and of finding the closest manifold point to a given query point. We use this technique to learn the "space of lips" in a visual speech recognition task. The learned manifold is used for tracking and extracting the lips, for interpolating between frames in an image se quence and for providing features for recognition. We describe a system based on Hidden Markov Models and this learned lip manifold that significantly improves the performance of acoustic speech recognizers in degraded environments. We also present preliminary results on a purely visual lip reader.
منابع مشابه
بهبود مدل تفکیککننده منیفلدهای غیرخطی بهمنظور بازشناسی چهره با یک تصویر از هر فرد
Manifold learning is a dimension reduction method for extracting nonlinear structures of high-dimensional data. Many methods have been introduced for this purpose. Most of these methods usually extract a global manifold for data. However, in many real-world problems, there is not only one global manifold, but also additional information about the objects is shared by a large number of manifolds...
متن کاملUsing Locality Sensitive Hashing for Fast Computation of Correlational Manifold Learning based Robust Features
This paper considers the application of a random projections based hashing scheme, known as locality sensitive hashing (LSH), for fast computation of neighborhood graphs in manifold learning based feature space transformations in automatic speech recognition (ASR). Discriminative manifold learning based feature transformations have already been found to provide significant improvements in ASR p...
متن کاملFace recognition based on manifold learning and Rényi entropy
Though manifold learning has been successfully applied in wide areas, such as data visualization, dimension reduction and speech recognition; few researches have been done with the combination of the information theory and the geometrical learning. In this paper, we carry out a bold exploration in this field, raise a new approach on face recognition, the intrinsic α-Rényi entropy of the face im...
متن کاملA Geometric Perspective on Speech Sounds
In order to effectively approach high dimensional pattern recognition problems, one seeks to understand and exploit any inherent low dimensional structure. Recently, a number of manifold learning algorithms have been motivated by a geometric point of view that models high dimensional data as lying near a low dimensional submanifold of the original space. Our paper has two main goals: (i) to inv...
متن کاملNonlinear Image Interpolation using Manifold Learning
The problem of interpolating between specified images in an image sequence is a simple, but important task in model-based vision. We describe an approach based on the abstract task of "manifold learning" and present results on both synthetic and real image se quences. This problem arose in the development of a combined lip-reading and speech recognition system.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995